Delaware
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- Europe > Portugal (0.04)
- Europe > France (0.04)
- (216 more...)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Government (1.00)
- Energy (1.00)
- (4 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
- (4 more...)
Unlocking the Potential of Global Human Expertise
For example, in the Pandemic Response Challenge experiment, the context consisted of data about the geographic region for which the predictions were made, e.g., historical data of COVID-19 cases and intervention policies; actions were future schedules of intervention policies for the region; and outcomes were predicted future cases of COVID-19 along with the stringency
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- Europe > Portugal (0.04)
- Europe > France (0.04)
- (216 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
- (4 more...)
- North America > United States > Ohio > Franklin County > Columbus (0.04)
- North America > United States > Ohio > Delaware County > Delaware (0.04)
- North America > United States > Delaware > New Castle County > Newark (0.04)
- (2 more...)
- Information Technology > Security & Privacy (1.00)
- Banking & Finance (1.00)
- Law (0.93)
Deprecating Benchmarks: Criteria and Framework
Joaquin, Ayrton San, Gipiškis, Rokas, Staufer, Leon, Gil, Ariel
As frontier artificial intelligence (AI) models rapidly advance, benchmarks are integral to comparing different models and measuring their progress in different task-specific domains. However, there is a lack of guidance on when and how benchmarks should be deprecated once they cease to effectively perform their purpose. This risks benchmark scores over-valuing model capabilities, or worse, obscuring capabilities and safety-washing. Based on a review of benchmarking practices, we propose criteria to decide when to fully or partially deprecate benchmarks, and a framework for deprecating benchmarks. Our work aims to advance the state of benchmarking towards rigorous and quality evaluations, especially for frontier models, and our recommendations are aimed to benefit benchmark developers, benchmark users, AI governance actors (across governments, academia, and industry panels), and policy makers.
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
- Europe > Lithuania > Vilnius County > Vilnius (0.04)
- Asia > China (0.04)
- (5 more...)
- Overview (0.68)
- Research Report (0.64)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)
- Information Technology > Artificial Intelligence > Issues (0.68)
Temporal Analysis of Climate Policy Discourse: Insights from Dynamic Embedded Topic Modeling
Badekale, Rafiu Adekoya, Akinfaderin, Adewale
Understanding how policy language evolves over time is critical for assessing global responses to complex challenges such as climate change. Temporal analysis helps stakeholders, including policymakers and researchers, to evaluate past priorities, identify emerging themes, design governance strategies, and develop mitigation measures. Traditional approaches, such as manual thematic coding, are time-consuming and limited in capturing the complex, interconnected nature of global policy discourse. With the increasing relevance of unsupervised machine learning, these limitations can be addressed, particularly under high-volume, complex, and high-dimensional data conditions. In this work, we explore a novel approach that applies the dynamic embedded topic model (DETM) to analyze the evolution of global climate policy discourse. A probabilistic model designed to capture the temporal dynamics of topics over time. We collected a corpus of United Nations Framework Convention on Climate Change (UNFCCC) policy decisions from 1995 to 2023, excluding 2020 due to the postponement of COP26 as a result of the COVID-19 pandemic. The model reveals shifts from early emphases on greenhouse gases and international conventions to recent focuses on implementation, technical collaboration, capacity building, finance, and global agreements. Section 3 presents the modeling pipeline, including preprocessing, model training, and visualization of temporal word distributions. Our results show that DETM is a scalable and effective tool for analyzing the evolution of global policy discourse. Section 4 discusses the implications of these findings and we concluded with future directions and refinements to extend this approach to other policy domains.
- Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.05)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- (12 more...)
- Law > Environmental Law (1.00)
- Government (1.00)
- Energy > Energy Policy (1.00)
Delta - Contrastive Decoding Mitigates Text Hallucinations in Large Language Models
Huang, Cheng Peng, Chen, Hao-Yuan
Large Language Models (LLMs) have demonstrated remarkable capabilities in natural language processing tasks. Still, they are prone to generating hallucinations--factually incorrect or fabricated content that can undermine their reliability, especially in high-stakes domains such as healthcare and legal advisory. In response to this challenge, we propose Delta, a novel inference-time approach that leverages contrastive decoding to mitigate hallucinations without requiring model retraining or additional training data. Delta works by randomly masking portions of the input prompt, then contrasting the original and masked output distribution generated by the model, effectively mitigating hallucinations through inferenceonly computations. Delta was evaluated on context-rich QA benchmarks like SQuAD v1.1 and v2, achieving around 3 and 6 percentage points of improvement, respectively. It also showed gains of 7 and 2 percentage points on TriviaQA and Natural Question under-sampling decoding. Delta improved SQuAD v2's noanswer exact match by over ten percentage points.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > United States > Texas > Travis County > Austin (0.04)
- North America > United States > Ohio > Delaware County > Delaware (0.04)
- (7 more...)
Unlocking the Potential of Global Human Expertise
Meyerson, Elliot, Francon, Olivier, Sargent, Darren, Hodjat, Babak, Miikkulainen, Risto
Solving societal problems on a global scale requires the collection and processing of ideas and methods from diverse sets of international experts. As the number and diversity of human experts increase, so does the likelihood that elements in this collective knowledge can be combined and refined to discover novel and better solutions. However, it is difficult to identify, combine, and refine complementary information in an increasingly large and diverse knowledge base. This paper argues that artificial intelligence (AI) can play a crucial role in this process. An evolutionary AI framework, termed RHEA, fills this role by distilling knowledge from diverse models created by human experts into equivalent neural networks, which are then recombined and refined in a population-based search. The framework was implemented in a formal synthetic domain, demonstrating that it is transparent and systematic. It was then applied to the results of the XPRIZE Pandemic Response Challenge, in which over 100 teams of experts across 23 countries submitted models based on diverse methodologies to predict COVID-19 cases and suggest non-pharmaceutical intervention policies for 235 nations, states, and regions across the globe. Building upon this expert knowledge, by recombining and refining the 169 resulting policy suggestion models, RHEA discovered a broader and more effective set of policies than either AI or human experts alone, as evaluated based on real-world data. The results thus suggest that AI can play a crucial role in realizing the potential of human expertise in global problem-solving.
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- Europe > Portugal (0.04)
- Europe > France (0.04)
- (222 more...)
- Research Report > Promising Solution (0.46)
- Research Report > Experimental Study (0.34)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Med-IC: Fusing a Single Layer Involution with Convolutions for Enhanced Medical Image Classification and Segmentation
Islam, Md. Farhadul, Zabeen, Sarah, Manab, Meem Arafat, Mahin, Mohammad Rakibul Hasan, Mondal, Joyanta Jyoti, Reza, Md. Tanzim, Hasan, Md Zahidul, Haque, Munima, Sadeque, Farig, Noor, Jannatun
The majority of medical images, especially those that resemble cells, have similar characteristics. These images, which occur in a variety of shapes, often show abnormalities in the organ or cell region. The convolution operation possesses a restricted capability to extract visual patterns across several spatial regions of an image. The involution process, which is the inverse operation of convolution, complements this inherent lack of spatial information extraction present in convolutions. In this study, we investigate how applying a single layer of involution prior to a convolutional neural network (CNN) architecture can significantly improve classification and segmentation performance, with a comparatively negligible amount of weight parameters. The study additionally shows how excessive use of involution layers might result in inaccurate predictions in a particular type of medical image. According to our findings from experiments, the strategy of adding only a single involution layer before a CNN-based model outperforms most of the previous works.
- Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.04)
- South America > Peru > Lima Department > Lima Province > Lima (0.04)
- North America > United States > Ohio > Delaware County > Delaware (0.04)
- (3 more...)
- Instructional Material (0.87)
- Research Report > New Finding (0.86)
- Health & Medicine > Therapeutic Area (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Grasping by Hanging: a Learning-Free Grasping Detection Method for Previously Unseen Objects
Li, Wanze, Su, Wan, Chirikjian, Gregory S.
This paper proposes a novel learning-free three-stage method that predicts grasping poses, enabling robots to pick up and transfer previously unseen objects. Our method first identifies potential structures that can afford the action of hanging by analyzing the hanging mechanics and geometric properties. Then 6D poses are detected for a parallel gripper retrofitted with an extending bar, which when closed forms loops to hook each hangable structure. Finally, an evaluation policy qualities and rank grasp candidates for execution attempts. Compared to the traditional physical model-based and deep learning-based methods, our approach is closer to the human natural action of grasping unknown objects. And it also eliminates the need for a vast amount of training data. To evaluate the effectiveness of the proposed method, we conducted experiments with a real robot. Experimental results indicate that the grasping accuracy and stability are significantly higher than the state-of-the-art learning-based method, especially for thin and flat objects.
- North America > United States > Delaware > New Castle County > Newark (0.14)
- Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.04)
- North America > United States > Ohio > Delaware County > Delaware (0.04)
- (2 more...)
DataLike: Interview with Motunrayo Kilanko
Motunrayo Kilanko is a seasoned data management and analytics specialist who has worked in the fields of data analysis, data management, and data annotation for machine learning. She works presently as a management analyst with a government healthcare agency in the State of Delaware, United States. She is also an AI enthusiast that teaches women how to use AI for their work and business. Her career interests spans Data, AI, public health, and empowerment of women. She is the founder of Femote, a social impact startup that provides business support and outsourcing services such as data annotation, data processing, and data entry to companies around the world by trained and skilled female professionals from Africa.
- North America > United States > Ohio > Delaware County > Delaware (0.26)
- North America > United States > Delaware (0.26)
- Africa > Nigeria > Oyo State > Ibadan (0.06)